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Systematic Gene Search in the Incyte LifeSeq Database 




Comparison of databases 



normal tissue-specific 
(expected: 100-500) 




tumor tissue-specific 
(expected: 100-500) 



Genes of Interest 



FIG. I 



APPROVED jO^G. FiG 



BY 



CLASS 



draftsman] 



SUBCLASS 



09/673395 



Principle of EST Assembly 
~ 50,000 ESTs per tissue 




Assembly at 0% mismatch 
with GAP4 (Staden) 



Contigs 



Individual Sequences 



Contigs increasing in 
number and length 



Iterative assembly with 
increasing mismatch 
(1%, 2%, 4%) 



5000-6000 Contigs 



~ 25,000 other individual 
sequences 
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~ 30,000 consensus- 
sequences per tissue 
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~ 50,000 ESTs 
of a tissue 
(e.g.: uterus tumor) 



GAP4 Assembly 1st Round: 
minimum initial match: 20 
maximum number of inserted 
blanks per sequence: 8 
maximum percent mis match: 0 



GAP4-Database 1: 
Contigs 1 

Individual Sequences 1 




unassembled 
ESTs 




GAP4 Assembly 2nd Round: 
minimum initial match: 20 
maximum number of inserted 
blanks per sequence^ 8 
maximum percent mis match: 1 




rr — 

GAP4-Database 2: 
Contigs 2 

Individual Sequences 2 




unassembled 
ESTs 




GAP4 Assembly 3rd Round: 
minimum initial match: 20 
maximum number of Inserted 
blanks per sequence: 8 
maximum percent mismatch: 2 
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GAP4-Database 3: 

Contigs 3 

Individual Sequences 3 
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unassembled 
ESTs 




GAP4 Assembly 4th Round: 
minimum initial match: 20 
maximum number of inserted 
blanks per sequence: 8 
maximum percent mis match: 2 




GAP4 Assembly 5th Round: 
minimum initial match: 20 
maximum number of inserted 
blanks per sequence^ 8 
maximum percent mis match: 4 
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GAP4 Assembly 6th Round: 
minimum initial match: 20 
maximum number of inserted 
blanks per sequence^ 8 
maximum percent mismatch: 4 




Assembled database 
of a specific tissue 
(e.g.: uterus tumor) 
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Assembled database 
of a specific tissue 
(e.g.: uterus tumor) 



Consensus 6 



Read-in as individual 
sequences 




Database 
of a specific tissue 
(e.g.: uterus tumor) 




Database of a second 

specific tissue 
(e.g.: normal uterus) 



GAP4 Assembly 
minimum initial match: 20 
maximum number of inserted 
blanks per sequence: 8 
maximum percent mis match: 4 





Tumor tissue- 
specific ESTs 



Non-tissue- 
specific ESTs 



Normal tissue- 
specific ESTs 
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silico subtroction of gene expression in various tissues 



In sili 



'30,000 consensus sequences 
normal tissue 



~ 30,000 consensus sequences 
tumor tissue 



Q 
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Specific genes 




Assembly at 4% mismatch 



Cancer tissue 
Specific genes 



Genes expressed in both tissues 
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Genes of interest 



Determination of tissue-specific 
expression via electronic 
Northern (INCYTE LifeSeq 
and public EST databases) 



Candidate genes for tumor 
suppressors or tumor activators 



FIG. 4a 
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Electronic Northern Blot 
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Fisher's Exact Test 
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Automatic Lengthening 
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Isolation of genomic BAC and PAC clones 



Chromosomal clone localization via FISH 



Hybridization signal 
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Sequencing of clones that are located in regions that have 
chromosomal deletions in prostate and breast cancer leads to 
identification of candidate genes 
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Confirmation of candidate genes by screening of 
mutations and/or deletions in cancer tissues 
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